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SYSTEM AND METHOD FOR BUILDING USER PROFILES 

Cross-Reference tn Rftlati^ A pplicfltinne 

This application claims the benefit of U.S. Provisional Applications No. 
60/076,179 and 60/076,404 having a common filing date of Febniaiy 27, 1998. 

Field of the Tnvmrinn 

The invention relates to systems and methods for monitoring and measuring flie 
interests of a user viewing content on a computer network, in particular on multiple 
servers in an enterprise network, while protecting the privacy of the user. 

Background nf the Tnvptirinn 

It is useful for vendors who sell items through the Internet to be able to compile 
sophisticated marketing data that indicates users' interest in the vendors' Web pages: 
Interest in a particular page may be determined by counting the number of "hits" on that 
page (i.e., the number of times the page is accessed) or by combining a count of the 
number of hits with data indicative of the amount of time users spend viewing the page. 
The number of hits for a page and the amount of time spent by usere viewing a page are 
both determinable using conventional techniques. 

Although measurement of users' interest in each of a vendor's pages is useful, 
additional useful information may be obtained by correlating these measurements on a 
per user basis. Thus, a vendor may learn that a fust type of user that is interested in 
page A is also usually interested in page B, while a second type of user that is interested 
in page C is also usually interested in page D. Such information allows the vendor to 
customize his Web pages on the fly for each user so that a user that initially selects 
particular pages is presented with the opportunity to select more of the type of pages in 
which tfiat user is expected to be interested. 
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However, in many instances, a user accessing the Internet jumps fmm server to 
server. Unless diffisrent vendors on different servers agree to cooperate, it is extremely 
difficult, if not impossible, for a first vendor on a first server to know that a user who 
accessed a particular one of the first vendor's pages also accessed a particular page of a 

5 second vendor on a second server. In addition, diere is no built in mechanism on the 

Internet for globaUy identifying users so that vendors on different servers can share such 
information. Also, even in instances where a group of cooperating vendors have 
adopted a cross-serveruser identification scheme that globally identifies the users to the 
vendors, it is questionable whedier such schemes violate users' privacy since each of the 
10 participating vendors is exchanging information about usets that the users might not 

want to be shared. Moreover, such limited cooperative efforts at cross-server 
identification generaUy employs a single identifier for each user. However, cross-server 
identification schemes tiiat employ a single identifier have disadvantages. For example, 
should one of the vaidois stop collaborating witii tiie others, such as because of an 
organizational or business change, issues of ownership and access to information tied to 
a shared identifier can arise. 
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Accordingly, it is an object of the invention to provide a distributed 
identification scheme which allows individual servers to control tiieir own local 
20 identification scheme and to collaborate with otiier servers at its manager's discretion. It 

is yet anodier object of the invention to compile page usage information about users 
accessing multiple savers in an enterprise network without potentially violating the 
privacy of tile users. 

25 Summaiynf%TnvHtfifin 

A distributed user identification process is provided that allow individual local 
servers or domains to control tiieir own user identification scheme and to collaborate 
with otiier local servers or domains at tiie discretion of an enterprise server. Tht 
enterprise server correlates tiie local user identification scheme witii a global user 

30 identifier and may disclose to interested outside parties, such as advertisers, only tiie 

global user identifier without revealing flie identity of a user who interacts wifli a local 
server. 
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Also provided is a process for compiling anonymously a global user profile from 
local user profiles generated by die local setvets. 

In general, according to one aspect of the invention, a computer network 
includes at least one local server and an enterprise server in communication with the 
local server. The local server ^blishes a local ID for the user and communicates to the 
enterprise server die local ID of the user and a local user profile based on user 
interaction with the local s«ver. The enterprise server links the local ID to a global ID 
assigned to the user by the enterprise server and records in a database the information 
about the local user profiles to form the global interest profile of the user. 

Preferred embodiments nmy include one or more of the following features. The 
global ID may be known only to die enterprise server and may be kept secret fcom die 
local servers. User information recorded in the enterprise database may include die local 
ID of die user. The local user ID assigned by one local server may be hidden from die 
odier local servers. The local user profile may be communicated to the enterprise server 
at predetermined times and/or when a number of changes made to die local profiles are 
greater dian a predetermined number of changes. The global ID may be assigned to the 
user direcdy by die enterprise server when die user first accesses die enterprise server. 
Alternatively, die global ID may be assigned to die user when die user accesses one of 
the local servers and die local server communicates die local ID of the user and possibly 
also a local user profile to die enterprise server. Tlie local ID and die global ID may be 
persistent and include state information. Hie state information may be communicated 
between die usct and die local server and die enterprise server widi die help of cookies. 
The local server may communicate die local user ID to die enterprise server by 
transmitting oa an HTML page a URL ^ch may include a gtzphk symbol of zero 
widdi and height, or by temporarily redirecting die URL selected by the user to a local 
URL. Transmission of the URL may be transparent to die user. 

A global interest profile may be established for each user of at least a subset of 
users and die global interest profiles between different users may be compared At least 
one score may be computed for a user and die score of die user may be compared to a 
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conesponding score of another user. He scores may represent an absolute number 
score, me local user profile may established incrementally by adding information about 
a most recent user interaction to a legacy user profile stored at the local server. TTie user 
profile may be processed in real time and weighted according to the recency of the user 
interaction with the local server. He global user profile for a phnality of users may be 
updated in a single pass. 

In another aspect of the invention, a computer ^paratus for establishing a global 
interest profile of a user includes at least one local server in communication with th^ 
user via a communication chamiel wherein the local server assigns a local ID for the 
user during the first access by the user to die local server. An enterprise server 

communicates with the userandthe local serverviathecommunication chamiel and 
assigns a global ID for the user. The local server commmucates to the enterprise server 
the local ID of the user and possibly also a local user profile based on user intemction 
witii the local server. The cnteipiise server links the local ID to the global ID and 
records in a database information about die local ID and, if desired, also the local user 
profile to form a global interest profile of the user. 

In yet another aspect of the invention, a method momtors interactions between a 
cUent and a plurality of servers communicating with one another in a computer network 
by designating one of the servers as an enterprise server and die remaining servers as 
local servers. The local server, upon interaction with the client, establishes a local ID for 
the cUent and communicating at least the local ID of the client to the enterprise server. 
When the enterprise server receives fix)m the local server the local ID of the client or 
when the client interacts directiy with die enten«ise server . the enterprise server assigns 
a unique global© to die cUent and links the local ID with the global ID. 

Preferred embodiments may include one or more of the foUowing features, llie 
enterprise server and the local servers may form an enterprise group. Hie client may 
receive state information fixim the server upon interaction with the server and may 
transmit the state inforaiation during a subsequent interaction with the server. Likewise, 
tbe local server may receive fiom die enterprise server state information related to the ' 
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QlmU a&d may transmit the state informatira during a subsequent interaction wiA the 
enterprise server that relates to &e same client The state information may be persistent 
and stored in form of a cookie. 

In yet another aspect of the invention, a computer program residing on a 
computer-readable medium inchides mstructions for causing an enterprise server to 
establish a unique global ID for a client and link the global ID with a local ID associated 
with the client on a local server. The program may also form a global interest profile of 
the client based on local interest profiles compiled by the local server. 

Brief Description Of Thg Prayjnes 

Hie forgoing and other features and advantages of the present invention will be 
^reciated more fiilly fiom the following fiirther description thereof, witii reference to 
the acconq)anying drawings wherein: 

FIG.l is a fimctional block diagram a computer network; 

FIG. 2 is a flowchart of a process operating on a local server for establishing a 
local user profile; 

FIG. 3 is a flowchart of a process operating on an enteiprise server for linking a 
local usCT ID to a global us^ ID; 

nG.4 is a flowchart of a process for building a global user profile on the 
enterprise servo:; 

FIG. 5 is an cmapit of a user's session record; 

FIG. 6 is a flowchart of a process operating on the enterprise server for creating 
a multi*user profile; and 

FIG. 7 is a block diagram of client interaction with die servers. 
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Detailed Description nf the nins tiated PnUffl rtimfinff} 

To provide an overall understanding of the invention, certain illustrative 
embodiments wiU now be described. However, it will be understood by one of oidinaiy 
skiU in the art that the systems described herein can be adapted and modified to provide 
5 systems for other suitable applications and that other additions and modificati^ 

made to the invention without departing from the scope hereof 

Referring now to HGS. 1 and 7,apart 10 ofthe Internet computer network 
includes a client 12 and a group of servers 14-1 8. The client 12 may be any one of a 
10 variety of conventional, commercially available, hardware and software combinations 

configured to access Internet servers by any one of a variety of suitable means. 

Similariy, the servers 14-18 may also be any one of a variety of conventional, 
commercially available, hardware and software combmations configured to provide 
conventional Internet services to users. Conventionally, the particular hardware and 

15 software combination used by any one ofthe servers 14-18 is independent of die 

particular hardware and software combination used by any other one ofthe servers 14- 
18. In some instances such as those described below, the conventional server software 
is supplemented to provide the fimctionality discussed herein. The servos 14-18 and the 
client 12 communicate with each other via communication links 24, 25 and 29 which are 

20 all connected to a communication channel 28. 



For the system described herein, a subset ofthe servers 15-17 form an enterprise 
group 22 that monitors and measures users' page usage among all ofthe local servers 
15-17 ofthe group 22. One of the servers 16 is designated as die "enterprise server^ for 
the group 22 while the other servers 15 and 17 that arc part ofthe group 22 are 
designated as "local servers". Communication links 26 shown in FIG. 7 between the 
local servers 15 and 17 and the enterprise server 16 ofthe enterprise group 22 are not 
physical communication links, but are intended to illustrate the information exchange, 
such as infomiationaboutthe users' page usage, between the servers 15 and 17 and the 
enterprise server 16. ITie client 12 can also communicate direcUy with the enterprises 
server 16, as indicated in HG. 7 by communication link 27. Communication links 26 
and 27 can also be used to exchange state infonnation, as discussed below. In the 
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exemplaiy«mbodiment of FIG. 1, the savers 14 and 18 are not part of tfie entoprise 
group 22 and therefore do not exchange usage information with the enterprise sHvers 
16. TTie servers 14 and 18, however, can still be accessed by flie client 12. 

Note that the term "locaT is meant to convey tiie relationship of tiie servers 15 
and 1 7 within the enterprise groiq) 22. The local servers 15 and 1 7 can be accessed by 
users in tiie same manner tiiat other conventional Intanet servers are generally 
accessible. The following discussion will be limited to the servers 15-17 which are part 
of the enterprise group 22. 

As indicated in FIG. 7, the cliat 12 may access any of tiie servers 15-17 by: (1) 
establishing a connection to tiie server, e.g., in an Intanet connection, by entering tiie 
server's URL (Uniform Resource Locator) address www . server# . com; (2) using tiie 
established connection to provide tiie server witii a request for specific data; and (3) 
receiving tiie requested data fmm tiie server via tiie connection. The Internet location 
- may also include, e.g., appended to die URL, tiie server location of tiie data, tiie file(s) 
on tiie server tiiat contain tiie data, and tiie type of tiie data (i.e., graphic image, HTML 
page, etc.). 

The client 12 requests HTML pages from tiie servers 15-17 tiiat, when displayed, 
may include user-actuatable links to otiier HTML pages. The otiier HTML pages may 
be on die same server as tiie displayed HTML page, or may be on a different server. In 
many instances, actuating a link to an HTML page causes tiie user to transfer from one 
server to anotiier in a manner tiiat is, in many respects, tiMsparent to tiie user. 

The local servers 15 and 17 upload information about users' page accesses to tiie 
enterprise server 1 6. TTie enterprise server 1 6 combines tiie information for each user so 
tiiat, for example, it is possible to correlate accesses by a particular user of Web pages 
on tiie local server 15 witii accesses by tiie same user of Web pages on tiie local server 
1 7. However, as seen in FIG. 7 and described in more detail below, it is tiie enterprise 
server 16, raflier tiian tiie local savers 15 and 17, tiiat correlates tiie cross-server 
information on a per user basis. Each of tiie local servers 15 and 1 7 uses a "local" D for 



eachuswthatacccssestbelocalscrversl5andl7dire^ Tie local ID's are different 
for each of the servers 15 and 17 so that the local servers 15 and 17 are prevented fiom 
direcUy sharmg or correlating information about a particular user without assistance 
fiom the enterprise saver 16. 

Since the HTTP protocol is stateless, the servers and any gateway program on 
the servers retain no knowledge of any previous transactioa Without persistent state 
information, the server wiU not be able to identify the client and/or obtain information 
about the cKent Likewise, without state information, the local servers 15 and 17 will not 
be able to communicate local information about the client to the enterprise server 16. 
The exemplary system described herein employs cookies to preserve state information. 
However, any other mechanism that preserves state information can be used. 

The features of cookies are described in. for example. TheHTMT .w.|yfnk 
third edition, by Ian S. Graham, published by Wiley & Sons, he, 1997. By way of 
background, cookies represent one possible mechanism for storing istate and/or 
identificatioii information on a user's local server 12. The server accessed by the user 
sends the cookie information to the user via a conventional command capable of 
transferring cookie information from the server to the user. Thereafter, whenever the 
user requests data from the server that set the cookie, the user request also includes the 
cookie previously sent by the server to the user. The command that sets the cookie 
causes the cookie information to correctly identify the server that executes the 
command. In addition. generaUy, a cookie is only sent to a server that set the cookie. 
Thus, if a particular server sets a cookie, the cookie includes information indicating diat 
it was set by the particular server. Tliere are mechanisms in place to prevent tfiat cookie 
from being sent to any other server. 

In the exemplary embodiment, each of dje local servers 15 and 17 assign tbeir 
own unique persistent state information to the client 12 in form of a local ID. The 
enteiprise server 16 assigns a secret persistent state information to the client 12 in form 
of a "global" ID and correlates the global ID with each of the unique local ID's assigned 
by each of the local servers 15 and 17. All of the ID's are made persistent using 
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cookies. The local sever IS sets a codde containing a unique local © for the client 12 
assigned by the local server 15. The client 12 subsequently provides the assigned local 
ID to the local sever 15 each time the cliat 12 requests data (e.g., an HTML page) 
fiom the local server 15. Thus, the local server 15 is provided with a basis for knowing 
the identity of the client 12 each time die client 12 requests data fiom die local server 
1 5. Similarly, die local server 1 7 sets a cookie containing a unique local ID for die 
client 12 (unrelated to die local ID assigned by die local server 15) that die client 12 
subsequendy provides to die local server 17 each time die client 12 requests data fiom 
the local server 17. 

Information regarding die data requests and die associated local ID's are 
provided by die local servers 15 and 17 to die enterprise server 16. The local servers, 
however, do not reveal die true ideitity of die user to die enteprise server 16. The 
enterprise server 16 can map differoit local ID's for die same user to die single, secret, 
global ID. Thus, die enterprise server 16 is in a unique position to correlate cross-server 
information about users while die local servers 15 and 17 can not directiy correlate 
cross-server information because neidier of die local servers 15 and 17 possesses die 
secret global identifier assigned by die enterprise server 16. 

Referring now to FIG. 2, a flow chart 30 illustrates an embodiment of die 
process of die invention based on software operating on die local servers 15 and 17. The 
process 30 begins at a test step 32 after a data request has been submitted by die client 
12. At die test step 32, it is determined whedier die client 12 has ever requested data 
from die particular local server prior to die current request Note diat, as discussed 
above, if die client 12 had ever accessed die particular local server, dien die client 12 
would have a cookie diat had been set previously by die particular local server. Thus, if 
die dient 12 does not provide a cookie widi die data request, dien it is deteimined at die 
test step 32 diat die client 12 has never accessed die local server and control passes from 
step 32 to step 34, where die local server creates a unique local ID for die client 12. The 
local server can generate unique local ID's in a variety of conventional manneis familiar 
to one of ordinary skill in tiie art, including, but not limited to, incrementing a stored 
value and dien providing an alphanumwic version of diat value as die local ID. 



Following stq) 34 is a stq) 36 where the local server forces a transfer to the 
enterprise server 16 (i.e., the client is transferred to the enterprise server 16)- Generally, 
a variety of conventional techniques exist for forcing a transfer to another server. For 
example, the local server may use conventional techniques to insol a special URL into 
the HTML page requested by the client 12, Hie special URL points to the enterprise 
server 1 6 and calls for insertion of a graphics image have zero width and height The 
special URL may also contain additional information, such as information identifying 
die local server and information indicating the local ID of die user. The additional 
information may be appended to the end of the URL in the form of 

http://enterprisejerverjd/go?local_sen^erjd4telient_informatio This process may be 
transparent to the user. 

Alternatively, redirection may be used to transfer the user to the enterprise 
server. Redirection involves providing an HTTP response message to die browser 
which forces the browser to look for a different URL. Hie local server redirects flie 
client 1 2 to the enterprise server 1 6 by, for example, returning the location of the 
enterprise server 16 in die form: 
location: server_url continents 

Browsers that understand die location field will automatically connect to the URL of 
the enterprise server 16. 

The forced transfer serves to effectively "register^* die local ID of die client 12 
widi die enteiprise server 16. As mentioned above, die forced transfer can be 
transparent to die client 12. Processing at die enteiprise server 16 in response to a 
forced call is described in more detail hereinafter. Note, howev©-, diat once die 
enterprise server 1 6 has completed die processing, die client 12 is returned to the local 
server. In die case of using redirection, die enterprise server 16 simply redirects die 
client 12 back to die local server that die client 12 was accessing prior to being 
transferred to die enterprise server 16. Following die step 36 is a step 38 where die 
local server sets a local cookie for die client 12. The steps 34, 36, 38 are executed only 
once, i.e., die first time die client 12 accesses die local server. After diat, die client 12 
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wiU send to the local server the cookie set by the respective local server whenever the 
client 12 accesses the respective local servw. 



FoUowing the step 38, or following the step 32 if the local server receives a 
cookie from the cUent 12, is a step 42 where the local server compiles information 
based, for example, on frequency and duration of page accesses by the client 12. This 
infomiation may be compiled by, for example, providing a plug-in to the local server 
that inchides a conventional server API call to the plug-in each time a user requests a 
page. Thus, for example, the time duration that a user spends viewing page A may be 
determined by registering the time when a user requrats page A, registering the time 
when the user requests a subsequent page B, and calculating the difference between the 
two times to determine the duration. Following the step 42, processing is complete for 
the local server handling the page request of the client 12. As discussed in more detail 
below, the enterprise server 16, using a secret global ID known only to the enterprise 
server 16. combines the information provided by the local server with mformation 
relating to the same user from other local servers that is mapped to the various local 
ID'S assigned to a user by the different local servers. 

■ For the enterprise server 16 to be able to compile information about the users, it 
is necessary for the local servers to periodically forward to the enterprise server 16 the 
gathered information along with the local ID for each user and information identifying 
the local server. This may be accomplished using any one of a variety of conventional 
techniques. In a preferred embodiment, the local server formats the information as a 
plurality of HTML pages that are uploaded to the enterprise server 16 in a conventional 
mannerusingconventionalHTIPexchangetechniques. The local server initiates the 
transfer either when a local buffer of the local server exceeds a predetermined size, or 
after a predetermined amount of time has passed since a previous update of the 
enterprise server 16 by the local server. The predetermined size and the predetermined 
amount of time are chosen based on a variety of functional factors femUiar to one of 
ordinary skill in the art, including amount of storage available at the local server and hit 
rate of the local sCTver. 
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Referring now to FIG. 3, a flow chart 50 illustrates steps perfomed by the 
enterprise server 16 in response to a forced transfer by a local server, such as that 
illustrated by the step 36 of HO. 2. Processing begins at a step 52 where the enterprise 
server 16 receives the new assigned local ID as weU as the server identification fiom the 
calling local server. As discussed above, this information may be encoded in the special 
URL provided by die local server. However, other conventional techniques for 
conveying this information exist including, but not limited to. passed arguments, 
environment variables, and cookies passed between the local server and the enterprise 
scrvCT 16. 

Following the step 52 is a test step 54 which determines if the client 12 has ever 
accessed the enterprise server 16 (Le., the test step 54 determines if the client 12 has 
everaccessedanyofthe servers 15-17 ofthe group 22). TTiis can be determined using 
cookies where the enterprise server 16 sets a cookie and provides it to the client 12. 
Thus, if the enterprise server 1 6 docs not receive a cookie fiom the cUent 12, then it is 
determined at the test step 54 that the client 12 has never accessed the enterprise server 
16 and control passes from the test step 54 to a step 56 where a new global ID is created 
for Ac client 12. Note that die system is designed so that only one global ID is created 
for each user. FoUowing the step 56 is a step 58 where the global ID is forwarded to the 
client in die form of a cookie. The global ID may be created in any one of a variety of 
conventional manners femiliar to one of ordinary skill in the art, including, but not 
limited to, incrementing a stored value and then providmg an alphanumeric version of 
diat value as the global ID. 

If it is determined at the step 54 that die enterprise server 16 received a cookie 
fiom die cUent 12, then control passes fiom the test step 54 to a step 62 where die global 
ID, passed via die cookie, is mapped to die new local ID provided by die local server. 
Note tiiat die step 62 also follows die steps 56, 58 where die global ID is created and 
passed to die cUent 1 2. The mapping at die step 62 can be performed in a variety of 
conventional manners, including usmg an array indexed according to die local server 
and local ID and containing entries corresponding to die global ID. Alternatively, the 
mapping may be provided using an appropriate data structure, such as a linked list 
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having nodes indicating the local server, the local ID provided by the local servCT, and 
the corresponding global ED. Altonatively still, the mapping may be stored in a 
database having a plurality of records, each containing the global ID, the local ID, and 
the site identifier for the local server that assigned the local ID. In tiiis way, a single 
global ID may be mapped to multiple local ID's assigned by multiple local servers. 

Following the step 62, processing is complete for registering the new local ID of 
the client 12 with the enterprise server 16. After this registration process, the local 
servor does not force a transfer to the enterprise server 1 6 when the client 12 accesses 
the local servw. Instead, as discussed above, the local server compiles data which is 
subsequently transferred to the enterprise server 16. It is the enterprise server 16 that 
combmes all of the data fiom all of the local serves on a per user basis and makes the 
thus-compiled information available in a way that does not necessarily identify the 
client 12. 

The process illustrated in FIGS. 2 and 3 can be used to build a global user profile 
tiiat is compiled firom user interaction with the local servers 15 and 17 using die local 
user IDs and possibly also with the enterprise server 16. Typically, the process monitors 
several characteristics of the user's visit, such as, for example, die subject matter of die 
visited web pages and the duration of these visits. This collected information can be 
used to characterize the user's interest in a given interest category and to d^rmine what 
available content would be of interest to die user. The generated global user profile is 
administered by the enterprise servo- 16 and identified by die global ID of die user 
witiiout necessarily revealing die identity of die client 12. 

A user's interest behavior can be tracked over a history of Internet sessions, so 
tiiat a composite view of die user's interests can be generated. Additionally, die 
described processes may take into consideration die age of die collected behavior 
information, so tiiat older behavior information has less impact on an interest score than 
more recent behavior information. TTie processes may be sensitive to die duration over 
which each page is viewed and die generated interests scores are provided in an absolute 
interest scale, as opposed to a relative interest scale. This fecilitates raeaningfid 
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comparisons of interest levels between different iisers, and offers a powerful tool for 
identifying related interests for users having selected demographic characteristics. TTie 
processes described herein may also be applied incrementaUy and reduced to a set of 
paraUel operations to increase greatly the speed of analyzing the collected behavior 
informatioa It will be understood that the process may also operate as a stand-alone 
process on a single web site and, moreover, may not require the use of local and global 
IDs. 

The process for building a user profUe begins with the step of collecting useful 
information about the interest behavior of a user 12 looking at different content stored at 
an exemplary local server 15 (FIG. 1). This data collection step is shown as step 42 in 
HG. 2. The process 70 depicted in HG. 4 starts with step 72, during which the server 
15 determines that a session has begun with a particular user 12. As described above, 
the server 15 can make this determination by identifymg a cookie sent fiom the client 
12. If the process 70 finds a cookie then the process 70 determines that the user is 
known to the server 15 and begins coUecting data about the user's session. The process 
70 then generates a local session ID which can be a sfanple signal that identifies one 
series of related interactions between the user and the server 15. For example, for the 
first time the user contacts the server 15, the process can set the local session ID to 
000001. For subsequent visits, the process can increment the local session ID. Hie 
process keeps track of the last local session ID generated to ensure that new local 
session IDs are generated for each session. Other examples of methods for generating a 
session IDs are known in the art and any suitable method can be practiced with the 
process. 

In a subsequent step 76, the process builds a local session recoiiL This is 
typically done by analyzing the click stream generated by the user as the user looks at 
the content displayed on the web page or pages located on server 15. An example of 
one type of session record is shown in m. 5. Specifically, no. 5 shows a session 
record 90 that is stored in a database maintained by the server 15. The depicted session 

record 90 is assodated with a local user ©, as the process in this example mainto 
record of each of the sessions the user has had with the respective server 15 since 
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infonnationabom the user was last uploaded to flieenterp Itwfllbe 
apparent to one of ordinary skill that FIG. 5 provides an example of a session record, but 
that other formats can be employed wi A the process described herein. 

FIG. 5 further depicts that each session record can store a list of the web pages 
viewed by the user while visiting the server 15 and other information about the content 
of these web pages. Further, for each page viewed, the process can store, for example, 
information of the type shown in the page block 96, which includes a list of Interest 
categories, Int^Cat, associated with the page and a list of content interest scores, C, that 
represents how strongly the content of the viewed page is related to the interest 
categories; a date time stanq), t, which gives a statement of the time and date on v^diich 
the page was accessed by die user, and d, a measure of the duration for which the page 
was viewed by the user. Page block 96 shows that each page can be associated with one 
or more mtercst categories, such as Int_^Catl, Int_Cat2, and Int_Cat3, etc. Moreover, a 
dififerent content interest score can be given for each interest category associated with 
^Ae page. The process can store a page block for each page viewed during a session. 
Two page blocks, 96 and 98 appear in FIG. 5, however, any number of page blocks can 
be stored depending on the number of pages viewed by the user. 

Although data collection has been described for a user*s activities at one web 
site, it will be understood that the data collection process 70 can occur on a number of 
different web sites. This means that one web site dedicated to providing sports 
information can collect information about the users favorite sports and favorite teams, 
while another web site that offers books for sale, can collect information about the 
user's fevorite categories of books. Accordmgly, a wide range of the user's interests 
can be captured. 

Returning to FIG. 4, the process 70 proceeds from step 76 to step 78 when an 
"end of session" is detected This may occur, for example, when the user 12 fails to 
access a page on the server 15 within a preset time, for example thirty minutes. 
However, it will be apparent that other techniques for determining the end of session can 
be used. In step 80, the process 70 stores the session record into a local database of 
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session records. Then, at a selected time, such as once a day, the process 70 in step 82 
will upload the contents of the local database of server 15 to a global database of the 
enterprise server 16. This provides the enterprise server 16 with click stream 
mfonnation representative of the interests of the user 12 during the user's session on 
server 15. The information can be processed and assembled for generating a global 
interest profile for that user. As discussed above, the interest information can be 
combined with the user's demographic, geographic and other suitable mfoimation to 
build a user profile of the user. Likewise, this type of click stream data may also be 
uploaded from the other saver 17 of the enteq)rise group 22, 

Referring now to FIG. 6 and Tables 1 and 2, once the information gathered on 
server 15 reaches the enterprise server 16, an enterprise process 120 running on 
enterprise server 16 can analyze the information to generate and update interest profiles 
for the user. Table 1 shows the variables and the pseudo-code of the enterprise process 
120; Table 2 lists the equations used in the enterprise process 120 and referenced in 
FIG. 6. The pseudo-code of Table 1 includes comments that describe the variables 
appearing in the code. 
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Variables 


Pseudo-code 


/ : interest behavior vector of four 


Declare types L A 


dimensions: (c, t,d,s) for a given interest 


Define scalino fimrHrm f*/ \ 

A^vA&uv aboiiilg iliUvUUn J 


category 


Define raw intprpct fimr^rm f / \ 


A : array of interest behavior vectors / 


'^vxAuv luoiuiicdj uiicresi luncDon XvA ) 


for a given user 


For each user 


c ; content interest score (range 0,00 


For each catecrnrv 


to LOO) 


If incremental 


i : date/time in seconds (since 1/1/96 


uivii \uuinpuie uisioncai 


00:00:00) 


iuicrc5i voniriDUnon USmg tj/ 


. yu^c vi^wc^u m ssconos 




(compensated for download time) 


^ompuie r lor current period 


S ' session id 


using f^A) 


r : raw interest score 






Compute i using fiC^AjHffcO,, g) 


0.00 to 1.00) 




tg : date/time of prior profiling 


vjuipul inierest score i 


algorithm execution 




tc : current date/time 


Save / as / for neirt pYpnitinn 


P : duration coefficient parameter 




y : age coefficient parameter (half life 




of interest score in seconds) 




K : exponential decay constant 





Table 1 
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The functions referenced above are set forth below in equations 1 tiirou^ 9: 

L f,(A)=/ 

1 



2. fi(x)=y=l. 



3. fr'(y)=x= 



l+;t 



1 



^.<1 —- 1 
l-y 

y=l 200 



4. i = f,(f,(A)) = l. 



1 



5. r = f,(A) = 2; 



I+f,(A) 



= 1 



(Zf.(^)c)(2) r'^ =(2)- r 2| f,(2:f,(p<<)c)(2)f 



6. A=A,uA2 

^■ f.{'-..',)=r(0(2) ^ 



& r=fXA)=f,U)+f,(/„0=f,(4)+^-(,-)(2) r 

9. f,(A) = / = f,(f,(A)) = f,(f,U) + f.(,-, .(.)) 



Table 2 
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The pseudo-code listed in Table 1 can proc^ infonnation, in particular click 
stream data, collected about a plurality of users 12 having visited a phirality of web 
pages located on servers 15, 16, 17. In the enterprise process listed in Table 1, die 
process generates for each user 12 an array, A, of interest behavior vectors, /. wherein 
each behavior vector is associated with a given interest category. Each of the vectors / 
can be a multi-dimensional vector, such as the four dimensional vector (c. /. d. s), 
wherein c, rf and 5 are the parameters provided by tiie click stream data genwated at a 
web site for a given user. As discussed with reference to FIG. 5, c is representative of 
the content interest score for a page stored at the server, t is representative of the 
date/time expressed in seconds, when the page was last viewed by the user, measured 
from a reference date/time, e.g., 1/1/1996; d is representative of the duration of time for 
which the page was viewed and is typically provided in units of seconds; and s is 
rq)resentative of the session ID. 

Referring back to FIG. 6, the enterprise process 120 operates on an interest 
category-by- interest category and a user-by-user basis and, to that end, transitions a 
loop 122 that selects interest categories one at a time and begins to determine the user's 
interest profile for that interest category. 

Once an interest category is selected for a specific user, e.g. "User 1" of FIG. 6, 
a first step 126 determines if there has been a previously determined interest score, i, for 
the selected interest category. If no such historical score exists for the interest category 
of the user, then the process 120 will compute in step 128 an initial interest score for the 
selected category I. 

The initial interest score may be computed, for example, bom information c, d, t, 
and s, provided by Ae click stream data. Equation 5 provides one technique for 
processing the click stream data. 

Equation 5 is applied in step 128 to compute a raw interest score r by summing 
over all sessions s, taking into consideration the content interest score c, a duration 
coefficient parameter fi, and the duration d during which the selected categoiy of 
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content was displayed to the user during one session. The duration coefficient 
parameter prelates to the time of observation versus the time of the computation. P 
increases when less computation takes place during the observation. Equation 5 also 
applies a scaling function, f„ defined in Equation 2 that scales, or nonnalizes, certain 
values in Equation 5, such as 4/, to a number between zero and one. Finally, Equation 
5 also sums and normalizes the user's interest over all observed sessions and multiplies 
the session record with an exponential function which takes into consideration the 
relevance of older data. The yield of Equation 5 is a measure of the user's raw interest 
in a selected category. 

If it is determined in step 126 that an interest score already exists for the 
specified user, then the process 120 goes to stq) 130 to determine the extent to which 
the previously determined historical interest score should effect the new interest score. 
In one practice, this historical contribution i, is determined from Equation 7. As seen 
from Equation 7, the historical interest score / is multipUed by the inverse interest 
scaling function f,' to remove the effect of the previously appUed scaling process and to 
remove the effect of the exponential time decay function of Equation 5 applied during 
the previous session. After the interest profile for the current sessions is determined 
using Equation 5, step 132, the historical contribution i, determined in step 130 and the 
current raw interest score determined in step 132 are added (Equation 8) and normalized 
(Equation 9) to form a combined tq)dated interest score / , stq) 1 34. 

The new interest score can be stored for the user and the process will move to 
another interest category for die user, steps 136 and 142. Once the information for all 
requested interest categories for the users has been processed, the time of last execution. 
t„ is updated to tiie current time, t„ and saved, step 138. The process can then be 
repeated for anodier user, step 140. 

As discussed above, each of die interest behavior vectors, /, can be associated 
with a given interest categoiy. An interest categoiy can be any subject, topic, concrete 
or abstract concept, e.g. sports, music, politics, general news or history. One or more 
categories can be associated with a page stored at a server site and made available to a 
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user. In practice, each page stored at a site can be manually, or automatically assigned a 
given category or a plurality of given categories each of which is associated with a page. 
For example an HTML page stored at a server can contain information about a movie 
describing the Civil War. In this example, this page can be associated with an interest 
categoiy related to movies, as well as an interest category related to history, or the Civil 
War. The content interest score can be nomialized so as to range in value between 0.00 
and 1.00, The process 120 can also be used to assign a score to a page. Accordingly, a 
page associated with the interest category can also be associated a content interest score 
that represents how closely associated the selected page is for a particular category or 
categories. 

In another practice, the user profile generated by the systems described herein 
can also include demographic or geographic information which is collected through 
conventional means, typically a form the user fills out when visiting the web site. The 
user profile comprises both demographic information and interest behavior information. 
: It will be understood that the combination of these two types of data can allow for 
certain types of analysis such as the interest levels of certain demographic groups in 
certain subject matter to provide detailed market analysis and to identify link interest 
between seemingly disparate subject matters. Other advantages of the systems 
processes described herem will be apparent to those of ordinary skill in the art 

As seen fix)m FIG. 6, the enterprise process 120 described above generates a user 
profile and/or updates an existing user profile for a specified interest category in a single 
pass. The user profile can be incremented in real time and new interest categories can be 
easily added. Moreover, the interest measures can be provided as absolute scores, 
allowing comparisons between the interest scores of different users. Additionally, the 
information is provided in a human readable form that allows ready access to the data 
derived firom a user's interest behavior. 

The invention can be implemented in digital electronic circuitry, or m computer 
hardware, firmware, software, or in combinations of them. Apparatus of the invention 
can be implemented in a computer program product tangibly embodied in a 
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machine-readable storage device for execution by a programmable processor, and 
method steps of the invention can be performed by a programmable processor executing 
a program of instructions to perform (unctions of the invention by operating on ittpxn 
data and generating ou^uL The invention can advantageously be implemented in one or 
more computer programs that are executable on a programmable system including at 
least one programmable processor coupled to receive data and instructions ftom, and to 
transmit data and instructions to, a data storage system, at least one input device, and at 
least one output device. Each computer program can be implemented in a high-level 
procedural or object-oriented programming language, or in assembly or machine 
language if desired; and in any case, the language can be a compiled or inteipreted 
language. Suitable processors include, by way of example, both general and q)ecial 
purpose microprocessors. Generally, a processor will receive instructions and data ftom 
a read-only memory and/or a random access memory. Storage devices suitable for 
tangibly embodying computer program instractions and data include all forms of non- 
volatile memory, including by way of example semiconductor memory devices, such as 
EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard 
disks and removable disks; magneto-opUcal disks; and CD-ROM disks. Any of the 
foregoing can be supplemented by, or incorporated in, ASICs (application-specific 
integrated circuits). 

To provide for interaction witfi a user, the invention can be implemented on a 
computer system having a display device such as a monitor or LCD screen for 
displaying information to the user and a keyboard and a pointing device such as a mouse 
or a trackball by which tiie user can provide input to the computer system. The 
computer system can be programmed to provide a graphical user interface tiirough 
which computer programs interact witii users. 

While the invention has been disclosed in connection with the preferred 
embodiments shown and described in detail, various modifications and improvements 
thereon will become readily apparent to fliose skilled in the art The local user E) may, 
for example, be based in user input, such as a membership ID assigned by an 
organization, or may be selected by the user on a form submitted to the server upon 
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logiiL The enterprise senrer may parse tiie global user profile and release to an interested 
party, such as an advertiser, only those portions that are of interest to the advertiser 
and/or to whidi die advertiser subscribes. Accordingly, the spirit and scope of die 
present invention is to be limited only by the following claims. 
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Claims: 

1 . A method for monitoring interactions between a client and a plurality of 
servers conmiunicating with one another in a computer network, comprising: 

designating one server of the plurality of servers as an 
enterprise server and the remauiing servers as local servers; 

a local server, upon interaction wiA the client, 
establishing a local ID for the client and communicating at least flie local 
ID of die client to the enterprise server, 

the enterprise server assigning a unique global ID to the client 
upon at least one of: 

(i) receiving from the local server the local ID of the client, 

and 

(ii) the client interacting directly with the enterprise server; 

and 

the enterprise server linking the local ID with the global ID and 
storing the linked local ID and global ID in a database. 

2. The method of claim 1, wherein the local server further communicates to 

r 

the enterprise server a local user profile based on user interaction with the local servo", 
and wherein the enterprise server stores in die database the local user profile to form a 
global interest profile of the user. 

3. Hie method of claim 1 or 2, wherem the global ID is kept secret from the 
local servers. 

4. The method of claim 1 or 2, wherein at least one of the local servers and 
the enterprise server sends state information to the client. 

5. The mediod of claim 1 or 2, wherem the enterprise server sends state 
information to the local server. 
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6. Hie method of claim 1 or 2, i^erein die local server communicates the 
local user ID to the enterprise s^er by transmitting a URL which is prefi^ly 
transpar^t to the user. 

5 7. The method of claim 1 or 2, wherein the local ID of the user is known 

only to the local server which assigned the local ID. 

8. The method of claim 2, wherein the local user profile is communicated to 
the enterprise server at predetermined times and/or when a number of changes in the 
local user profile as a result of recent user interaction are greater than a predetermined 
number of changes. 

9. The method of claim 2, wherein the global ID is assigned to the user 
when the user directly accesses the enterprise server or when the local sctvct first 
communicates the local ID of the user to the enterprise server. 

10. A method for building a current user interest profile of an interest 
category, comprising: 

building a session record of a user; 
computing firom the session record a contents interest profile; 
combining the contents interest profile with a weighted historical interest 
profile to form the current user mterest profile. 

11. Hie method of claim 2 or 10, ^^dierein a global mterest profile is 
established for each user of at least a subset of users and the global interest profiles 
between difierent users are compared. 

12. The method of claim 11, wherein at least one score is computed for a 
user and the score is an absolute number score or a normalized score. 

1 3. Hie method of claim 1 2, wherein the score of one user is compared to a 
corresponding score of another user. 
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14. The method of claim 2 or 10, wherein a local user profile is established 
incrementally by adding information about a most recent user interaction to a legacy 
user profile stored at the local server. 

15. The method of claim 2 or 10, wherein the user profile is processed in real 

time. 



1 6. The method of claim 2 or 10, wherein the user profile is weighted 
according to the recency of the user interaction. 

17. The method of claim 2 or 10, wherem the global user profile is updated 
in real time and/or in a single pass. 

18. The method of claim 12, wherein tfie cuirent user interest score is based 
on a weighted computing time relative to a viewing time of flie interest category. 

1 9. The method of claim 12, wherein the current user score is built 
incrementally torn a sequence of scaled session records, 

20. The method of ckiim 12, wherein the user profile and/or the user score 
comprise demographic and psychogr^)hic information. 

21. A coqiputer program, residing on a computer-readable medium, 
comprising instructions for carrying out the method according to at least one of claims 1 
to 20. 

22. Apparatus comprising a computer-readable storage medium tangibly 
embodying program instructions for carrying out the method according to at least one of 
claims 1 to 20. 
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